Overview

Dataset Statistics

Number of Variables 9
Number of Rows 64000
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 6603
Duplicate Rows (%) 10.3%
Total Size in Memory 14.7 MB
Average Row Size in Memory 241.1 B
Variable Types
  • Numerical: 2
  • Categorical: 7

Dataset Insights

history is skewed Skewed
Dataset has 6603 (10.32%) duplicate rows Duplicates
used_discount has constant length 1 Constant Length
used_bogo has constant length 1 Constant Length
is_referral has constant length 1 Constant Length
conversion has constant length 1 Constant Length

Variables


recency

numerical

Approximate Distinct Count 12
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1024000
Mean 5.7637
Minimum 1
Maximum 12
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • recency is skewed right (γ1 = 0.1393)

Quantile Statistics

Minimum 1
5-th Percentile 1
Q1 2
Median 5
Q3 9
95-th Percentile 11
Maximum 12
Range 11
IQR 7

Descriptive Statistics

Mean 5.7637
Standard Deviation 3.5076
Variance 12.3032
Sum 368879
Skewness 0.1393
Kurtosis -1.3567
Coefficient of Variation 0.6086
  • recency is not normally distributed (p-value 1.588466696902521e-05)

history

numerical

Approximate Distinct Count 34833
Approximate Unique (%) 54.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 1024000
Mean 242.0857
Minimum 29.99
Maximum 3345.93
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • history is skewed right (γ1 = 2.4237)

Quantile Statistics

Minimum 29.99
5-th Percentile 29.99
Q1 64.66
Median 158.11
Q3 325.6575
95-th Percentile 747.2625
Maximum 3345.93
Range 3315.94
IQR 260.9975

Descriptive Statistics

Mean 242.0857
Standard Deviation 256.1586
Variance 65617.2326
Sum 1.5493×1007
Skewness 2.4237
Kurtosis 9.3459
Coefficient of Variation 1.0581
  • history is not normally distributed (p-value 4.054822229088002e-18)
  • history has 3593 outliers

used_discount

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 4224000

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 1
2nd row 1
3rd row 0
4th row 1
5th row 1

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 64000
  • The top 2 categories (1, 0) take over 50.0%
  • used_discount has words of constant length

used_bogo

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 4224000

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 1
3rd row 1
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 64000
  • The top 2 categories (1, 0) take over 50.0%
  • used_bogo has words of constant length

zip_code

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 4595104

Length

Mean 6.7985
Standard Deviation 1.9898
Median 5
Minimum 5
Maximum 9

Sample

1st row Surburban
2nd row Rural
3rd row Surburban
4th row Rural
5th row Urban

Letter

Count 435104
Lowercase Letter 371104
Space Separator 0
Uppercase Letter 64000
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Surburban, Urban) take over 50.0%

is_referral

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 4224000

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 1
3rd row 1
4th row 1
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 64000
  • The top 2 categories (1, 0) take over 50.0%
  • is_referral has words of constant length

channel

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 4477900

Length

Mean 4.9672
Standard Deviation 2.7759
Median 5
Minimum 3
Maximum 12

Sample

1st row Phone
2nd row Web
3rd row Web
4th row Web
5th row Web

Letter

Count 317900
Lowercase Letter 253900
Space Separator 0
Uppercase Letter 64000
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Web, Phone) take over 50.0%

offer

categorical

Approximate Distinct Count 3
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 4821709

Length

Mean 10.3392
Standard Deviation 3.3019
Median 8
Minimum 8
Maximum 15

Sample

1st row Buy One Get One
2nd row No Offer
3rd row Buy One Get One
4th row Discount
5th row Buy One Get One

Letter

Count 576242
Lowercase Letter 426775
Space Separator 85467
Uppercase Letter 149467
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (Buy One Get One, Discount) take over 50.0%
  • The largest value (one) is over 2.0 times larger than the second largest value (buy)

conversion

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 4224000
  • The largest value (0) is over 5.81 times larger than the second largest value (1)

Length

Mean 1
Standard Deviation 0
Median 1
Minimum 1
Maximum 1

Sample

1st row 0
2nd row 0
3rd row 0
4th row 0
5th row 0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 64000
  • The top 2 categories (0, 1) take over 50.0%
  • The largest value (0) is over 5.81 times larger than the second largest value (1)
  • conversion has words of constant length

Interactions

Correlations

Missing Values